Speech Recognition for Keyword Spotting using a Set of Modulation Based Features – Preliminary Results
نویسندگان
چکیده
We present the preliminary results of applying a set of parameters of the AM-FM model for recognizing word utterances. By acquiring modulation based parameters from the amplitude envelope (AE) and the instantaneous frequency – both obtained by demodulating at four selected center frequencies – a compact feature set is created for each frame of a word utterance. Applying a dynamic time warping of features, a dissimilarity measure between an unknown and one of several reference utterances is obtained to detect the presence of a keyword in a continuous stream of speech. A feature set consisting of the peak frequencies in AE and weighted formants, among others, shows an overall recognition score of 75 percent or higher – depending on the analysis frequencies used – for an extracted set of word utterances. The low false positive and false negative scores suggest the viability of modulation based parameters for building a keyword spotting system.
منابع مشابه
Keyword Spotting Based On Decision Fusion
Automatic speech recognition (ASR) technology is available now-a-days in all handsets where keyword spotting plays a vital role. Keyword spotting performance significantly degrades when applied to real-world environment due to background noise. As visual features are not affected much by noise this provides better solution. In this paper, audio-visual integration is proposed which combines audi...
متن کاملRobust Keyword Spotting Using a Multi-Stream Approach
Speech recognition systems are prone to severe degradation in noisy environments due to mismatch between training and testing conditions. A multi-stream approach for keyword spotting is proposed to improve robustness in mismatched conditions. The assumption is that most real world noises are colored and do not affect the full spectrum equally, meaning certain parts of the spectrum can still pro...
متن کاملComparison of keyword spotting methods for searching in speech
This paper presents and discusses keyword spotting methods for searching in speech. In contrast with searching in text, the searching in speech or generally in multimedia data still represents a challenge. The aim of the paper is to present a keyword spotting (KWS) method based on a large vocabulary continuous speech recognition (LVCSR) system, based on phonetics decoder, and keyword spotting u...
متن کاملAn Utterance Recognition Technique for Keyword Spotting by Fusion of Bark Energy and MFCC Features
This paper describes the preliminary results of a keyword spotting system using a fusion of spectral and cepstral features. Spectral energy in 16 bands of frequencies on Bark scale and 16 mel-scale warped cepstral coefficients are used independently and in combination with appropriate weights for recognizing word utterances. Results of matching features using Euclidean and cosine distances in a...
متن کاملAn Effective Approach for Chinese Speech Recognition on Small size of Vocabulary
In this paper, an effective approach for Chinese speech recognition on small vocabulary size is proposed the independent speech recognition of Chinese words based on Hidden Markov Model (HMM). The features of speech words are generated by sub-syllable of Chinese characters. Total 640 speech samples are recorded by 4 native males and 4 females with frequently speaking ability. The preliminary re...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010